Optimal Replication Strategy Using Rough Set Theory for Dynamic Content Replication
نویسنده
چکیده
Replication strategy is used to minimize the recovery time and data loss at an operational system and it attempts to reduce the system load, access latency and network congestion in a distributed system. Replication improves the reliability and scalability of the services. To increase availability we divide large files into small, medium and large fragments. When there are more number of hits for a data and when there is more delay in accessing a data we need to replicate it more. However these methods lead to high overhead for unnecessary file replication, fragmentation and consistency maintenance. Thus we place data on reliable servers. Any distributed database has three properties, consistency, availability and partition tolerance (CAP). In practice, at most two of these properties can be satisfied for any shared data. Thus a mathematical method called, Rough Set Theory (RST) can be used, that deal with vagueness and uncertainty in data and decision making. When available information in the database is insufficient, lower and upper approximations can be used to determine the exact value of a given set. Rough sets are created based on attributes and allocate memory for the sets. Based on ranking we decide whether it is fixed or variable size set, as we try to maintain time consistent data. The objective of this paper is to determine optimal data replicas using Rough Set Theory methodology, which groups data into sets of different granule size and generates a set of decision rules. This helps in determining which data set should be replicated more in order to provide better availability and consistent data. Keywords— Replication strategy, rough set theory, consistency of data
منابع مشابه
Improving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy
Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...
متن کاملImprove Replica Placement in Content Distribution Networks with Hybrid Technique
The increased using of the Internet and its accelerated growth leads to reduced network bandwidth and the capacity of servers; therefore, the quality of Internet services is unacceptable for users while the efficient and effective delivery of content on the web has an important role to play in improving performance. Content distribution networks were introduced to address this issue. Replicatin...
متن کاملContinuous time portfolio optimization
This paper presents dynamic portfolio model based on the Merton's optimal investment-consumption model, which combines dynamic synthetic put option using risk-free and risky assets. This paper is extended version of methodological paper published by Yuan Yao (2012). Because of the long history of the development of foreign financial market, with a variety of financial derivatives, the study on ...
متن کاملAn Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity
The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...
متن کاملDynamic Replication based on Firefly Algorithm in Data Grid
In data grid, using reservation is accepted to provide scheduling and service quality. Users need to have an access to the stored data in geographical environment, which can be solved by using replication, and an action taken to reach certainty. As a result, users are directed toward the nearest version to access information. The most important point is to know in which sites and distributed sy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015